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1.0 GENERAL INFORMATION 



l.l Physical requirements 

SORT is designed to operate with a Datapoint 2200 model 
I26t 16k version 2, and a 2200 series 350 disk with one to 
four drives. It is necessary to have Datapoint DOS resident 
and SORT cataloged within it. 

SORT will optimize its speed through allocation of its 
working files on the available drives. During this process 
it attempts to ascertain the availablili ty of sufficient 
disk space to achieve the desired sort. The program will 
abort at this point should the disk space be inadequate. 



1.2 Bringing up the system 

The two programs comprising sort are distributed on a 
cassette in the 'CMF' format meaning that they are cataloged 
on a CTOS tape. This is only a means of distribution and the 
cassette Tape operating System has no other connection with 
the sort programs . 

To incorporate the two programs from the CTOS tape into 
the Disk operating System directory* the following procedure 
proves least time consuming: By hand'*' advance the cassette 
beyond the loader (about 6 inches of actual tape). Bring up 
the DOS. Place the cassette in the front deck. Type to the 
DOS m 2.Q.&1LQ.&Q.JL1LZ . This will bring in the main SORT 
program. If no error messages are issued* proceed with the 
next step. Type U. iP,il/2.iijLil • SORT is now ready to 
operate. (Of course* the above procedure could be replaced 
with the usual 'OUT» the CTOS file and 'IN' the disk file, 
but it takes much longer). 



2.0 SIMPLE SORT CONCEPTS 



2.1 what the files look like 

All Datapoint systeas use a universal text file 
structure - Databus, Datashare RPG II, Basic* scribe. 
Editor* Assembler* Terminal emulators* etc. Therefore* any 
file generated by or for any of the above may be sorted. It 
must be on disk* however. 
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when sorting* consider that the result of the sort is 

not restructuring of the original file. It is a 

restructured COPY of the original file. The original file is 

never changed. 

Therefore* SORT produces a file which is a sorted 
version of the original. This gives the user the added 
opportunity of specifying the type of file to be the output 
regardless of the input file format (with one restriction 
see section 3.4) . 



2.Z The key options 




Consider the following record 
for reference only) : 



(column count scale below 



Mule, Francis A. 242219 123 BARN SAN ANTONIO TX 
12 3456789012 3456789012 34 56789012 3456789012 3456789012 34567890 



The name begins in column 1 and goes to 22. The employee 
number spans columns 24-29. The street address is 31-42. The 



city is 43-59. The State is 59-60. 

If each person had a record in the file exactly in the 
above format, SORT could order the sequence of records in 
the file by any of the above fields. For instance, to get 
an alphabetical list of the records by name, the KEY would 
be l to 22 (hereafter referred to as 1-22). The KEY for 
sequencing the file in order of employee number would be 
24-29. The key for ordering the records by state then city 
and then employee nunber would be 59-60*43-59, 24-29. 

It should be obvious that any part of the record can be 
used as a key. It may not be obvious, however, that the 
larger the key, the slower the sort - it is the case and is 
approximately proportional. 



2.3 How to sort a file 

sorting a file is done from the keyboard of the DOS. 
All the operator must know is the NAME of the file to be 
sorted, the naaedesired for the sorted output file, and the 
definition of the KEY. 

For instance, the keyboard issued command for the above 
example to sort on the name field (1-22), would be: 

This is assuming that the name of that file was 
EMPLFILE. It is also the operators decision as to what the 
resultant sorted file is called. The command could have 
been: &aai lH£Ji£lL£ A £li21&lllzZ2 , naming the resultant file 
EMPSRT. The second file named is where the resultant sort 
will be placed. 

More complicated keys may be stated as well, and the 
command to list by state and then name would be: 

This is simplified sorting. Most systems have source 
code for a Data.bus.or Assembly language program on file on 
the disk, such programs can be sorted, for instance, by 
label field and produce an interesting look at the 
dictionary: Ml l£L£iL£xMl£lIIilz6 . 

Sorting an assembly or Databus source file by op-code 
can provide an interesting analysis of the usage of each 
instruction type: 

mi MiiiiixMiiiitiiiria. 

Sort can provide an interesting, informative analytical 
tool to many other systems than simply the typical business 



requirements . These other aspects of SORT'S utility become 
increasingly significant when a sort function is as easily 
invokable and speedy as the Datapoint SORT system. 



3.0 THE OTHER OPTIONS 



3.1 Generalized command statement format 

The following is a generalized statement format for the 
Datapoint DOS SORT: 

SORT INPILE,OUTFILE<:DRk><; {<*>< IXDX L> 1 <S S S<-EEE>>> 

Information contained within a pair of corner brackets 
<> are optional .Default conditions are listed below. The 
items within square brackets () are not order dependent. 
Typical statements obeying this format are: 

(1) SORT INFILE,OUTFILE 

(2) SORT INFILE.OUTFILE; 1-3,7-20 

(3) SORT INFILE,O0TFILEJIDl-3 

(4) SORT INFILE,0UTFIL£**IDL7-20 

All the above statements will invoke a sort. Each will 
provide different results. However, notice that in (l) 
there are no other parameters than the file specifiers. 
That is because all the specifiable parameters have a given 
value in case there is no specif if ication for it. 

The following list defines the parameters which can be 
specified: 



<:DRk> 



This specifies the drive for the sort key 
file. This is only a working scratch file 
needed during the sort. It will go, in 
default, on drive in a one-drive system, 
and drive l in a multi-drive system. 
Removing it from the same drive as the input 
file will speed up the sort. 



<*> 



..This will cause a debug display of various 
internal parameters to occur during the sort 
and is only useful in helping systems 
programmers understand a difficult- 
to-explain behavior on the part of SORT. If 
'.« is not present, no debug display will 
occur. 



<I> 



Without typing the I, the output file will 

be SEQUENTIAL no matter what the input file, 

only if the input file is an INDEXED file, 

can the I parameter be included and cause 
the output file to be indexed. 



<D> Without typing the D, the collating sequence 



order is considered ASCENDING, 
the D parameter will cause the 
sequence to operate in DESCENDING 



Including 
collating 
order. 



<L> 



Normally the sort transfers the entire 
records of the input file to the output 
file. It is possible, not only to transfer 
part of each recordt but to include constant 
literals in each record as well. Including 
the L paraieter in the list of paraneters 
will cause another question to be asked 
wherein you nay specify the limitations and 
constants. 



<SSS<-EEE>> 



This is the sort key specification. It lay 

be repeated with comma separators. If no 

key is specified, the system will assume the 

identical sort to that which would occur if 

the operator typed 1-10, i.e. the first ten 

characters. The maximum number of keys is 



that which can be 
the input line. 



typed without exceeding 



3.2 Keys -overlap ping and in backwards order 

The key specification does not need to be forward only. 
A specification of 17-12 will cause the 6 delimited 
characters to be a key but in the order of 
17,16,15,14,13,12. This is extremely valuable, clearly, in 
data which has the most significant digit or character last. 

Key specifications may also be overlapping: 1-20,30-15 

overlaps 15 to 20. When this occurs, the system will 

optimize the sort and save time over re-sorting on those 
columns again • 



3.3 Ascending and Descending sequences 

Changing the collating sequence from ascending to 



descending is the same 
the last first etc. 
ascending sequence on 
Should it be sorted in 
would be first and Mr. 
collation follows the 



as •reversing* the file, or placing 
sorting a telephone directory in 
name produces the familiar order, 
descending sequence, then Mr. zyk 
Aardvark would be last. The 
ASCII order when alphabetic, 



and punctuation 
together . 



characters can all occur 



in 



order of 
numeric , 
a column 



3.4 Input/output file format options 



SORT accesses each file sequentially. Due to the 
techniques used in the Datapoint standard file structure! 
the sequential reading technique will provide sort with all 
of the records in the file whether the file was originally 
indexed or sequential. Therefore* the file fornat options 
only allow specification of the OUTPUT file's fornat* 

If the input file is INDEXED, that is one logical 
record or string per physical disk record* then you have a 
choice of output formats. If 'I' is chosen* that is indexed* 
then each output disk record will contain an exact copy of 
the appropriate input file record. If 'S* is chosen* that is 
SEQUENTIAL* then the input file* reordered* will be 
reblocked* space compression inposed* and appear* generally 
such lore compactly* in the output file in sequential 
format. 

If the input file is SEQUENTIAL in its original format, 
then there is only one choice for the output format. The 
output file format for a sort on an input file which is 
sequential MUST be SEQUENTIAL. 



3.5 Limited output format option 
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To invoke the 
operator includes the 
(see section 3.1). If 



limited output format option, the 
•L' parameter in the specifier list. 
and. only if the L is specified during 



the SORT call* will there be 
operator on the next line: 



a second question asked of the 



LIMITED OUTPUT FILE FORMAT: 

This question requires at least one non-trivial field 
specification or constant (see next paragraph). The number 
of field and constant specifications is only limited by that 
which can fit on the keyed in line. 

To permit even more utility in report generation, SORT 
allows inclusion of constants in the output record that did 
not occur in the input record. For instance* assume that the 
personnel data base was a full record of approximately 240 
characters and that the employees name appears in columns 80 
to no and his telephone number was in columns 171 to 180. 
To make a telephone directory in alphabetical order, one 
could answer the following to the limited file output format 
request : 



80-110, • 



• , 171-180 



Note that this would put out the name followed by four 



spaces, a hyphen, four more spaces and the 
number of input file fields and constants can 
the output file up to the limit of the line 
specification is typed. 



number . Any 
be placed in 
on which the 



Also note that the output file requires proportionally 
less room than the input file when limited. Often this fact 
can be applied when the disk file space is nearly exhausted 
and a sort is required. 



3.6 Key file drive number. 

There are three file systems associated with a sort. 
The first is, of course, the input file. The second is the 
output file. The third is the keyfile system. (The user only 
uses the output file ■- the keyfile system is a scratch file 
used by the system during sorting). There are 
files which are opened during the sort for 
system. They are *SORTKEY/SYS and *SORTMRG/SYS 
files can grow to considerable sizes during 



procedure since they are proportional 
records and the site of the key field. 



actually two 
the keyf ile 
These two 
the sort ing 
to the number of 



There are two considerations for the location of the 
keyfile system. The first is the problem of room. The 
keyfile must be on a drive with sufficient room to hold it. 
The second is speed. The greatest increase in speed occurs 
in removing the keyfile system from the same drive as the 
input file. Greater speeds can occur if it is, as well, not 



10 



on the saae drive as the output file. Normally the SORT 
does a good job of determining the best location of the two 
keyfile files, and it should not be necessary to specify 
anything for this. However* under complex circumstances, it 
nay be desirable for the operator to specify the drive 
nusber for the keyfile. Should this be the case* the user 
should type in the <:DRk> specification as indicated in the 
general connand foriat in section 3.1. 
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4.0 THE USE OF CHAIN WITH SORT 



The reader should first familiariie himself 
thoroughly reading the CHAIN User's Guide. 



with CHAIN by 



Chain is a system whereby the operator of a Datapoint 
Disk operating System may pre-define a procedure sequence of 
his own programst system commands and utilities (including 
keyboard answers to questions requested by these programs) 
and have them called and sequentially executed by a single 
name. This is especially powerful when using SORT since 
there may be a repetitive sequence of routines with complex 
pararaeterizat ions which would make good use of a 
simplification. 



4.1 How to set up a chain file for sort 

The author of a chain file needs to remember that ALL 
questions that the system requests INCLUDING those initiated 
by the executing programs MUST BE answered from the chain 
file just as though they would be typed in from the 
keyboard. 
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4.2 Naming a repetitive sort procedure 

Frequently there are sorts and printouts and other 
proceedures which occur together and for which a name 
invoking the procedure would be a great simplification. 

For instance, in the telephone directory example above, 
the process of sorting the file into a limited output file 
and then listing it on a local printer could be procedurized 
as follows: 
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SORT EMPFILE,TELFILE;L80-110 

80-110** - », 171-180 

LIST TELFILEfXL 

TELEPHONE DIRECTORY FOR XXXXXXXXXX 
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4.3 Initiating a sort froa another prograa 

The chain file (CHAINFIL above) could have been created 
by any Datapoint system which can write a file. This Bakes 
the concept even aore powerful since prograBS can create or 
Bodify subsequent procedures of itself* other programs* 
systea coBBands and utilities. RPG II and Databus 7 can make 
good use of this. 



4.4 Using chain to cause a serge 



consider a situation wherein a systea has a aaster file 
called *MASTER' and a file of records to be added* in 
sequence* to the aaster file called 'ADOFILE'. 
these two files in sorted sequence at the end of 
would normally require a sequence of keyed in 
which are soaewhat coaplicated and error prone, 
cause an effective MERGE and assign it a single 
f ol lows : 



To merge 

each day 

operations 

chain can 

name as 



SAPP MASTER, ADDFILE, MASTER 
SORT MASTER, SCRATCH. 1-20 
KILL MASTER/TXT 
NAME SCRATCH/TXT, MASTER/TXT 

Note that the procedure: 

1) appends the ADDFILE to the MASTER file. 

2) Sorts the extended MASTER file into a SCRATCH file. 

3£4) Renaaes the SCRATCH file as the new MASTER file. Thus* 
it is apparent that a merge can be effectively achieved 
using SORT by using chain to pre-define the procedure. 
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